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ABSTRACT 

The  scaling  technique  has  been  used  often  in  designing  efficient  algorithms  for  com¬ 
binatorial  optimization  problems.  This  paper  unifies  problem-specific  scaling  ap{»oaches 
into  a  linear  programming  framework.  Solution  of  linear  programs  by  scaling  involves  suc¬ 
cessive  solutions  of  what  we  call  the  tuning  problem.  This  tuning  problem  arises  when  one 
transforms  a  solution  for  the  previous  scale  into  a  solution  for  the  next  scale.  We  show  that 
the  tuning  problem  has  a  nice  format  under  certain  assumptions,  and  it  is  precisely  this  con¬ 
venience  which  has  led  to  fast  scaling  algorithms  for  many  combinatorial  problems.  We  also 
examine  schemes  that  use  a  relaxation  of  complementary  slackness,  and  we  show  that  one 
such  scheme  is  equivalent  to  scaling.  We  propose  a  generalization  of  an  approximation  algo¬ 
rithm  by  Gabow  and  Taijan  and  discuss  its  application  to  the  tuning  problem.  Finally,  we  dis¬ 
cuss  scaling  of  the  shortest  path  problem  and  the  weighted  matching  problem  in  our  linear 
programming  framework. 
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INTRODUCTION 


The  scaling  technique  has  been  effective  in  designing  efficient  algorithms  for  weighted 
combinatorial  problems  that  can  be  fonnulated  as  linear  programs  on  integer  polyhedra,  such 
as  shortest  s-t  path  and  weighted  matchings.  Edmcmds  and  Karp  [2]  presented  the  first  scaling 
algorithm  for  network  flow  problems.  Later  Gabow  [3]  [4]  proposed  scaling  algorithms  for  a 
variety  of  weighted  combinatorial  problems.  In  this  paper  we  unify  the  approaches  of  Gabow 
and  others  into  a  linear  programming  framework. 

The  typical  scaling  algorithm  works  as  follows.  Given  the  problem  P  with  an  integer 
vector  of  weights  w,  the  algorithm  recursively  solves  a  problem  P'  that  is  identical  to  prob¬ 
lem  P  but  with  w  replaced  by  [  .  Then  using  the  solution  to  /*'  it  produces  a  solution  to 

P .  Thus  If  this  step  takes  time  Tin),  where  n  is  a  measure  of  the  size  of  the  problem,  then  the 
algorithm  solves  P  in  time  Tf/t)  log  IV,  where  IV  is  the  largest  weight  in  the  problem. 

Scaling  may  be  seen  as  a  method  of  "combinatorializing"  an  optimization  problem. 
The  technique  is  not  as  powerful  as  the  Primal-Dual  algorithm:  While  the  Primal-Dual  algo¬ 
rithm  leads  to  a  series  of  {0,1}  weighted  problems,  the  scaling  algorithm  leads  to  (0,l,...3c} 
weighted  problems.  Nevertheless,  scaling  is  sufficiently  powerful  to  yield  efficient  algorithms 
with  sophisticated  data  structures. 

In  Section  1  we  first  present  a  linear  programming  (LP)  interpretation  of  scaling.  We 
discuss  the  operation  of  the  scaling  algorithm  on  an  integer  LP,  and  we  formulate  the  prob¬ 
lem  of  transforming  a  solution  to  P  into  a  solution  to  P'.  We  call  this  transformation  problem 
the  tuning  problem.  We  obtain  results  on  general  LPs  that  have  been  used  in  many  specific 
scaling  algorithms.  Using  these  results,  we  simplify  the  scaling  algorithm  of  Gabow  for 
weighted  general  matching  [4],  In  Section  2  we  present  an  analysis  of  Bertsekas’s  approxi- 
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mate  complementary  slackness  technique  [1]  and  show  its  equivalence  to  scaling.  We  also 
examine  successive  approximation  methods  for  solving  the  tuning  problem.  In  Section  3  we 
show  how  specific  scaling  algorithms  can  be  cast  in  the  general  framewoilc  of  Section  1.  An 
alternate  formulation  of  the  general  weighted  matching  algorithm  is  presented. 


-S**«iViVi^!lLj! 


CHAPTER  1 


SCALING  OF  LINEAR  PROGRAMS 


Let  A  be  a  matrix  and  b ,  c  be  vectors.  Consider  the  linear  program 


mmcx 


Ax=b 


(I.IA) 


Let  us  assume  that  we  have  solved  (I.IA)  above.  We  shall  examine  how  this  has  helped  us  in 


solving  the  perturbed  LP  (1.2A)  below. 


mincx-fdx 


Axszb 


(1.2A) 


If  the  vector  d  has  special  properties,  then  we  shall  see  that  solving  (I.IA)  has  made  the  task 
of  solving  (1.2A)  much  easier.  To  see  this,  let  us  consider  the  duals  of  (I.IA)  and  (1.2A), 


which  are  (I.IB)  and  (1.2B)  respectively. 


maxyf> 

yA^ 


max  yb 
yA£c+d 


(I.IB) 


(1.2B) 


Let  xo  be  an  optimal  solution  to  (I.IA)  and  yo  an  optimal  solution  to  its  dual  (I.IB).  Define 
e^c-yoA,  and  call  e  the  slack  vector  in  (I.IB).  By  (I.IB),  e^.  Note  that  by  complemen¬ 
tary  slackness  ex(f=0.  Now  substitute  yorfy  fory  in  (1.2B)  and  simplify  to  obtain  (1.3B) 
below. 


The  dual  of  (1.3B)  is  (1.3A)  below. 


maxyb 

yA<d-¥t 


min  (d-¥€)x 


Ax=b 


(1.3B) 


(1.3A) 


Let  us  examine  xo  as  a  tentative  solution  to  (1.3A).  Since  ex(f=0,  it  follows  that 
id+e)X(f=dxo.  At  this  point  let  us  assume  me  following: 

(i)  d  is  non-negative,  integral. 

(ii)  A,b,  and  c  are  integral. 

(iii)  Ax=b  defines  a  polyhedron  with  integral  vertices. 

(iv)  e  is  integral. 

Many  optimization  problems  satisfy  the  above  assumptions. 

Since  d  and  e  are  non-negative,  the  optimal  value  of  (1.3 A)  if  non-negative.  As  the 
cost  of  xq  is  dxQ,  xo  is  within  dxo  of  the  optimal  value.  Let  (x  i.y  i)  be  an  optimal  pair  of  solu¬ 
tions  to  (1.3A),(1.3B).  For  each  j,  if  ixi)j>0,  then  by  complementary  slackness  of  (xi.y  i)  we 


(yiA)j=(d+e)j. 

Consequently  ((yc&y\)A)j=(yoA)j+(y\A)j=(c-e)j+(d+e)j=(c+d)j.  Thus  we  see  that 
(jci.yo+yi)  exhibits  complementary  slackness  with  respect  to  (1.2A),(1.2B)  implying  that  the 
pair  is  optimal  for  (1.2A),(1.2B).  Therefore  a  solution  to  (1.3 A)  itself  is  a  solution  to  (1.2A). 
Also  note  that  since  yo^=cxo,  yo  a  feasible  solution  of  (1.2B)  and  is  within  dx^  of  the 
optimum. 
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Let  us  examine  the  problem  P  defined  below. 

min  gx  (g  is  integral) 

Ax=b  (1.4A) 

Assume  (ii)  and  (iii)  hold  for  P.  Define  mod  2  and  c=g-d.  Hence  =-j.  By 
definition,  d  is  0-1  and  c  is  a  vector  of  even  integers.  The  scaling  algorithm  first  solves  a 
problem  P '  by  changing  g  to  [  : 

min  yx  (y  is  integral) 

Ax=b  (1.5A) 

xSO 

Note  that  (1.5A)  satisfies  conditions  (ii)  and  (iii).  Suppose  we  solve  (1.5A)  recursively. 
^  (^i>yi)  is  an  optimal  solution  to  (1.5A)  and  its  dual,  then  (xi,2yi)  is  an  optimal  solution  to 

(1.1  A),(  1 . 1 B).  Therefore  the  slack  e  =c  -2y  \A  in  ( 1. 1 B)  is  twice  the  slack  -j— y  lA  in  the  dual 
of  (1.5 A).  Thus  if  the  entries  in  are  integral  multiples  of  (i.e.,  half-integers),  then  e  is 

integral,  and  (1.3A)  satisfies  conditions  (i)-(iv).  The  scaling  algorithm  then  has  to  solve 
(1.3A),(1.3B)  to  produce  a  solution  of  (1.4A).  Thus  we  have  broken  the  problem  P  into 
(I.IA)  and  (1.3A).  Problem  (I.IA)  is  equivalent  to  (1.5A),  and  the  largest  component  in  the 
cost  vector  of  (1.5A)  is  at  most  half  of  the  largest  component  in  (1.4A).  Working  under 
assumptions  (i)-(iv),  we  shall  examine  (1.3A),(1.3B)  closely. 

Let  xo  be  a  solution  to  (I.IA)  and  define  k=dxQ.  Since  Ax=b  has  integral  solutions,  k 
is  an  integer.  Then  "trim"  the  vector  d+e  in  (1.3 A)  as  follows.  Replace  all  entries  in  d+e  that 
are  greater  than  k  by  k-^\.  This  trimming  does  not  change  the  solutions  to  (1.3A)  because  we 
know  that  if  the  j  th  component  of  d+e  is  larger  than  k,  then  the  corresponding  Xj  cannot  be 
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positive  in  the  optimal  solution  because  the  optimal  value  of  (1.3A)  is  strictly  less  than  k+\. 
Alternatively,  delete  the  columns  in  A  conesponding  to  the  entries  greater  than  k  in  d+e  to 
get  a  reduced  (1.3A).  Thus  if  e  is  integral,  then  to  solve  (1.2A)  we  just  have  to  solve  the 
problem  below,  which  we  call  the  tuning  problem . 

min  (d+e)x  (d+e  is  trimmed) 

Ax=b  (1.6A) 

We  call  the  vector  d+e  the  offset .  Since  the  vector  d+e  can  take  values  from  the  finite  set 
{0.....k+l }  we  call  the  problem  semi-combinatorial  [7].  Efficiem  data  structures  may  then  be 
used  solve  the  new  (1.6A). 

In  summary,  we  see  that  the  problem  (1.4A)  can  be  broken  into  two  problems  that 
have  the  same  fonnat  as  (1.4A).  viz.  (1.5A)  and  (1.6A),  Problem  (1.5A)  may  be  solved  recur¬ 
sively.  Problem  (1.6A).  the  tuning  problem,  depends  on  (1.5A)  only  through  the  slack  vector 
for  the  solution  to  (1.5A).  The  solution  to  (1.6A)  is  also  the  solution  to  (1.4A).  The  optimal 
value  of  (1.6A)  is  bounded  by  dxo  where  xq  is  a  non-negative  solution  to  Ax=b. 

For  many  combinatorial  problems,  although  A,b,  and  c  are  integral,  the  components 
of  the  optimal  solutions  of  (1.1  B)  are  not  integral  but  are  integral  multiples  of  -j,  where  /  is 

an  integer.  In  Section  3  we  shall  examine  specific  LPs  with  /=2  and  1=4.  In  this  situation, 
instead  of  scaling  by  2,  scale  by  the  faaor  /.  TTiat  is,  given  the  problem: 

mingx 

Ax=b  (1.4A) 

x^ 

put  d=g  mod  /  and  c=g-d.  Now  (1.1  A)  is  equivalent  to  the  problem  (1.5C)  below,  which  is 
again  solved  recursively. 


mm  (yjx 

Acs*  (I.5C) 

As  j  too  is  integral.  A,  6,  and  the  cost  vector  y  are  all  integral  By  hypothesis,  the  optimal 
values  for  the  dual  variables  of  (1.5C)  are  multiples  of  -J-.  By  multiplying  the  dual  variables 

of  (1.5C)  by  I  we  obtain  an  optimal  solution  for  (I.IB),  which  must  be  integral.  Then  the 
slack  vector  e=c-yoA  is  integral  thus  ensuring  that  the  tuning  problem  satisfies  assumptions 
(i),  (ii),  and  (iii),  above. 

We  now  show  that  the  tuning  problem  (1.6A)  itself  caimot  be  solved  by  scaling.  For 

let  d+e=ci+di  where  the  components  of  dj  are  in  {0,1}  (respectively  in  {0 . i-1})  and  ci  is 

even  (respectively  divisible  by  /).  By  complementary  slackness,  if  (xo)y>0,  then 

making  ciX0=O.  Hence  the  new  scaled  down  problem 
(1.5Q  for  (1.6A)  has  zero  as  its  optimal  value,  xo  itself  being  a  solution. 

We  have  discussed  scaling  on  LPs  involving  minimization  of  a  cost  function.  If  the 
given  LP  is  a  maximizing  problem,  then  a  slight  modification  of  scaling  may  be  used.  Exam 
ine  problems  (1.7A)  and  its  dual  (1.7B),  and  (1.8),  (1.9)  below. 


max  wx 


'M 


Ax=b 


Ax=b 


For  convenience  we  refer  to  LP  (1.7A)  as  P(w),  (1.7B)  as  DP(yv),  (1.8)  as  PCf  and 
(1.9)  as  F([-^  ).  We  claim  that  the  scaled  down  problem  should  be  taken  as  F(r ^ ) 
instead  of  F  ( [  ^  )  for  the  following  two  reasons: 

(a)  If  P  is  a  maximizing  problem,  then  its  dual  DP  is  a  minimizing  proUem.  Thus  if 
is  a  solution  to  DF  ( [  ^  ),  then  lyo  need  not  even  be  feasible  for  DP  (w),  whereas  if  j  is 

a  solution  to  OF  ( f  ^  ).  then  2y  i  is  certainly  feasible  for  OF  (w ). 

(b)  The  slack  vector  e^-yoA  is  non-positive,  hence  it  is  useful  to  have  d  (as  in 
assumption  (i)  above)  non-positive.  Observe  that  in  the  case  of  scaling  on  a  minimization 
problem,  the  costs  of  intermediate  dual  solutions  at  succesive  scales  increase  monotonicaUy. 

In  the  case  of  a  maximization  problem,  choosing  F(  f  ^  )  as  the  scaled  down  problem  makes 
the  cost  of  intermediate  dual  solutions  decrease  monotonicaUy  ._ 

We  can  apply  the  observations  of  this  section  to  simpUfy  Gabow’s  algorithm  for  max¬ 
imum  cost  weighted  general  matching  [4].  Gabow  uses  F  d.  -^  )  as  the  scaled  down  problem. 
The  dual  variables  then  require  a  slight  adjustment  to  retain  feasibility  (see  (a)  above).  His 
algorithm  and  its  correctness  proof  may  be  simplified  by  choosing  P{\^  )  as  the  scaled 
down  problem.  This  follows  from  (a)  and  (b)  above.  Furthermore,  as  /=4  (i.e.,  the  dual 


variables  are  multiples  of  -^)  it  is  more  convenient  to  scale  by  4  than  by  2.  Consequently,  for 
a  simpler  algorithm,  /*  ( f  -^  )  may  be  chosen  as  the  scaled  down  problem. 
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CHAPTER  2 


APPROXIMATE  COMPLEMENTARY  SLACKNESS 

2.1  Bertsekas’s  Approximate  Complementary  Slackness.  Consider  the  linear  pro¬ 
gram  (2.1A)  and  its  dual  (2.1B). 

max  cx 

Ax=b  (2.1  A) 

min  yb 

yA^  (2.  IB) 

Let  j:°  be  feasible  in  (2.1A)  and  y®  be  feasible  in  (2.1B).  Define  the  slack  vector 
e^-y^A .  By  complementary  slackness,  the  pair  (x°,y®)  is  optimal  in  (2,1  A), (2. IB)  respec¬ 
tively  if  and  only  if  c°x°=0.  Restated,  (x®,y®)  is  optimal  if  for  all  J,  whenever  e;°>0, 

Beitsekas  [1]  introduced  the  notion  of  ’’approximate”  complementary  slackness  for  the 
min-cost  flow  problem.  Call  the  pair  of  feasible  solutions  (x*,y  *)  e-optimal  if  implies 
x/=0  for  all  j.  Note  that  a  0-optimal  solution  is  an  optimal  solution  to  (2.1  A), (2. IB)  because 
it  satisfies  the  complementary  slackness  conditions. 

Approximate  complementary  slackness  has  been  used  to  design  efficient  algorithms 
for  network  problems  [5,6].  For  a  given  problem,  it  is  first  proved  that  there  exists  an  Eo>0 
such  that  if  a  solution  is  eo-optimal,  then  it  is  0-optimaL  This  eo  is  specific  to  each  problem 
and  depends  on  the  values  the  dual  variables  can  take.  The  typical  algorithm  first  starts  with 
an  initial  guess  (x®,y®).  If  (x®,y°)  is  O-optimal.  then  the  algorithm  stops  since  (x^.y®)  are 
optimal.  If  the  solution  is  not  0-optimal,  then  there  must  exist  an  e  such  that  x®,y®  is  e- 


optimal:  For  example,  we  can  take  {cj-(y°A)j}.  This  e-optimal  solution  is  used  to 

generate  an  -|-optimal  solution,  which  is  then  used  to  generate  an  -l-optimal  solution,  and 

so  on,  until  finally  we  anive  at  an  eo-optimal  solution.  This  process  may  be  alternatively 
viewed  as  follows.  Consider  the  scaled  version  of  (2.1  A),  where  the  scaling  factor  is  2*.  i.e., 
the  new  cost  function  is  [ as  shown  below. 

min  L-^x 

Ai==b  (2.2A) 

Let  (xtO't)  be  an  optimal  solution  to  (2.2A)  and  its  dual.  Then  (Xk^^yk)  is  a  2*- 
optimal  solution  to  (2.1A)  because  (Xk,yk)  is  an  optimal  solution  to  (2.2A).  If  there  existed  a 
j  such  ttiat  (c-2*ytA)y>2*  but  (x*)y>0  ,  then  ^■^-ykA)j>0  but  (x*)y>0,  contradicting 

complementary  slackness  of  (Xk,y*)  in  (2.2 A).  Thus,  one  way  to  arrive  at  a  2*“‘ -optimal  solu¬ 
tion  would  be  to  solve  the  tuning  problem  for  (2.2A)  to  obtain  an  optimal  solution  for  the  LP 
with  cost  vector  of  [  .  This  process  is  continued  until  it  finally  arrives  at  a  such  that 

2**<eo  specified  previously. 

If  problem  (2.1)  satisfies  assumtions  (ii)  and  (iii)  of  Section  1,  and  the  dual  variables 
are  multiples  of  -|-  (/  is  an  integer),  then  the  components  of  the  slack  vector  are  integral  mul¬ 
tiples  of  -j.  Therefore  the  value  of  eo  is  precisely  where  /  is  also  the  scaling  factor  dis¬ 
cussed  in  the  Section  1 .  For  the  bipartite  matching  LP  eo=y ,  and  for  the  general  matching  LP 
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2  J  k-feasibility  and  k>optiinalhy.  Taijan  and  Gabow  [5]  applied  an  alternate  fonnu- 
lation  of  approximate  complementary  slackness  to  the  bipartite  matching  LP.  We  shall 
attempt  to  generalize  and  give  linear  programming  interpretations  of  their  formulation.  For 
the  LP  (2.1  A)  and  a  non-negative  vector  k ,  the  pair  of  vectors  (x*  is  called  k-feasible  if 

(i)  (x*)y>0  implies  (y*A)y=Cy 

(ii)  (x*)y=0  implies  (y*A  )j^j+kj 

Define  the  vector  x^  to  be  k-optimal  if  it  is  k-feasible  and  Xj^  is  feasible  in  its  LP 
(2.1  A),  i.e.,  Axk=b.  But  note  that  the  vectors  of  a  k-feasible  pair  need  not  even  be  feasible  in 
their  respective  LPs. 

Let  us  first  analyse  how  close  a  k-optimal  solution  X;^  is  to  the  optimal  value  for 
(2.1A).  First  note  that  ixk,yk)  is  feasiUe  in  the  following  LP  and  its  dual 

min  (c+k)x 

Ax=b  (2.3) 

x^ 

max  yb 
yA^+k 

Note  that 

(c  +k  )Xk-ykb  =  (c  +k  )x*-(yk  A  )x*  =  (c+k-ykA  )x* . 

By  k-optimality  of  (xt.y*),  if  (x*);>0  then  (ykA)j=cj.  Hence,  for  all  j, 

(c+k-ykA)j(,Xk)j^jixk)j-  Consequently,  (c+k)Xk-ykb^k-  This  implies  that  x*  is  within 

kxk  of  the  optimal  value  for  (2.3).  And  if  xq  is  an  optimal  solution  to  (2.1  A),  then  the  optimal 

value  for  (2.3)  is  at  most  kxo  away  from  that  of  (2.1  A).  Thus  x*  is  within  A:(xo+x*)  of  the 

optimal  value  of  (2.1  A)  and  X*  is  feasible  in  (2.1  A).  Define  I  Ixl  I  tobethe/i  norm  of  vector 

X.  If  all  the  solutions  of  Ax  =6  are  bounded,  i.e.,  I  lx  1 1^  for  all  solutions  of  Ax  =6,  then  x* 


k.»  'i.^  '.t.’  u  V  iL'  w  ^3  ».TK.  ■• 
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is  within  m^R  1 1  ib  1 1  of  the  optimal  value  for  (2.1  A). 

The  algorithm  to  achieve  this  k-optimal  solution  proceeds  as  follows:  First  the  cost 
function  c  in  (2.1A)  is  multiplied  by  m+l.  This  ensures  that  any  k-optimal  solution  to  this 
modified  (2.1  A)  is  also  optimal  for  it  A  starting  k-feasible  solution  xq  is  then  selected. 
Through  successive  refinements  this  k-feasible  solution  is  made  k-optimal.  Note  that  k- 
feasibility  is  more  relaxed  than  complementary  slackness  (or  0-optimality)  and  is  potentially 
easier  to  maintain. 

Now  what  remains  is  the  specification  of  the  refinemem  procedure.  This  refinement 
procedure  should  start  with  a  k-feasible  solution  (xi^i)  and  produce  another  k-feasible  solu¬ 
tion  that  is  either  k-qjtimal  or  "better"  than  (xuy  i). 

Here,  we  state  a  refinement  procedure  that  is  a  modified  relaxation  of  the  Primal-Dual 
algorithm.  It  may  be  interpreted  as  a  generalization  of  procedure  Match  of  [5].  Let  us  first 
review  the  Primal-Dual  algorithm  [7],  Given  an  LP  P,  its  dual  DP,  and  vector  ic  feasible  in 
DP,  the  Primal-Dual  algorithm  first  constracts  the  restricted  primal  RP  and  its  dual  DRP. 
Optimal  solutions  x'.rc'  for  RP  and  DRP  are  found,  and  k  is  upgraded  to  k-*^,  where  9  is 
chosen  appropriately.  Since  (x'.nT)  are  optimal  for  RP  and  DRP,  x'  and  tc+Otc'  satisfy  the 
complementary  slackness  condition.  Note  that  x'  itself  need  not  be  feasible  in  P.  The 
efficiency  of  the  Primal-Dual  algorithm  lies  in  this  inheritance  of  complementary  slackness, 
as  at  the  beginning  of  the  next  iteration  we  have  a  valid  candidate  solution  to  start  with. 

For  a  given  LP  P  as  in  (2.1A)  and  a  vector  x,  define  the  k-perturbed  LP  P{x)  with 
respect  to  X  to  be  the  dual  of  the  following  LP: 

max  yb 
yA^+5k 

where 


‘i^.  V  R  W  -  *i  -M* 


5;=0  if  Xj>0 
5j=l  if  Xj=C 

Note  that  if  (xic,yk)  is  k-feasible  for  P  and  its  dual,  then  (xit,yk)  satisfies  complementary 
slackness  conditions  with  respect  to  P  (x).  Also  note  that  while  xt  need  not  be  feasible  in  die 
primal,  yt  is  always  feasible  in  the  dual  ofP(x). 

We  now  state  procedure  Refine  below. 

Procedure  Refine 

{Input :  (xo,yQ)  which  is  k-feasible  in  P  and  its  dual 
Ouqnit :  (xio'  i)  k-feasible  or  k-optimal. } 

Step  0.  Find  the  restricted  primal  RP  with  respect  to  P  (xq). 

Step  1.  In  RP  find  a  k-optimal  solution,  if  one  exists.  Otherwise  find  a  "better"  k-feasible 
solution  xi.  Modify  the  dual  variables  from  yo  to  yi  such  that  xi  is  optimal  in  the  RP  with 
respect  to  P(xi). 

Step  2.  Update  the  dual  variables  as  in  the  Primal-Dual  algorithm  uruil  a  better  k-feasible 
solution  can  be  found.  □ 


In  Step  1.  if  a  k-optimal  solution  is  found,  then  this  solution  is  the  desired  k-optimal 
solution.  If  not,  then  the  dual  variable  modifications  ensure  that  (xi.y  i)  maintain  complemen¬ 
tary  slackness  in  the  RP  of  P(xi).  This  ensures  the  validity  of  Step  2,  because  complementary 
slackness  of  (xi.y  i)  after  Step  1  is  required  for  Step  2  to  work.  Step  1  involves  finding  a  vec¬ 
tor  yi  that  is  feasible  in  the  dual  of  P(xi)  and  maintains  complementary  slackness  with  xi. 
This  process  may  be  made  more  efficient  if  a  "better”  xi  is  found  in  a  systematic  way.  Step  2 
essentially  looks  for  an  optimal  solution  to  P(xi)>  and  therefore  unless  xi  is  k-optimal,  a 
bener  k-feasible  solution  can  always  be  found.  Thus  at  the  end  of  Step  2,  we  are  guranteed 
that  a  better  solution  can  indeed  be  found.  The  computational  convenience  of  the  algorithm  is 
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discussed  in  the  Section  2.3  . 

We  shall  compare  our  procedure  Refine  wiui  procedure  Match  in  [S],  which  solves 
the  weighted  bipartite  matching  i»oblem  for  the  graph  G  (YuViJE).  Define  1=(1,1,...,1).  Pro¬ 
cedure  Match  finds  a  1-optimal  matching,  thus  ifc=:(l,l...,l)  for  Match.  Let  yiv)  be  the  dual 
variable  associated  with  vertex  v  and  c(v,w)  be  the  cost  of  edge  (v.w).  If  is  a  matching 
(not  neccesarily  complete ),  then  an  edge  (v ,w)  is  called  eligible  if 

>(v)+y(w)=c(v,w)+{if(v,w)e  Af  then  0  else  1  }. 

Procedure  Match 

Step  1.  Find  a  maximal  seti4  of  vertex  disjoint  augmenting  paths  of  eligible  edges.  For  each 
path  P  e  A,  augment  the  matching  along  P,  arxl  for  each  vertex  we  Vip^,  decrease  y(w) 
by  1.  (This  makes  the  new  matching  M'  1-feasible.)  If  the  new  matching  Af'  is  complete, 
halt 

Step  2.  Do  a  Hungarian  search  to  adjust  the  duals  and  find  an  augmenting  path  of  eligible 
edges.  □ 

Note  that  the  set  of  eligible  edges  generates  precisely  the  RP  of  P(A/).  Step  1  of 
Match  is  equivalent  to  Step  1  of  Refine .  Finding  a  maximal  set  of  augmenting  paths  and 
decrementing  partially  the  dual  variables  ensure  that  the  new  matching  and  the  dual  variables 
maintain  complementary  slackness,  and  that  the  new  matching  is  optimal  in  the  RP  with 
respect  to  the  new  duals.  This  ensures  the  validity  of  the  Hungarian  search  of  Step  2.  Let  us 
look  at  Step  1  of  Match  more  closely.  In  particular,  we  shall  examine  the  notion  of  "better" 
k-feasible  solution  (Step  1  of  Refine )  in  relation  to  Match .  First  note  that  any  intermediate 
1-feasible  solution  xq  (in  Match)  satisfies  Acq^,  where  i4  are  firom  the  bipartite  matching 
LP  primal,  because  xo  always  represents  a  matching,  though  not  neccesarily  a  complete 
matching.  Thus  x\.  another  1-feasible  solution,  may  be  called  better  than  xq  if  the  slacks 
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satisfy  1 1  b-Axo  1 1  >  1 1  b-Axi  1 1 .  Since  the  matrix  A  for  the  bipartite  matching  LP  is  totally 
unimodular,  for  integral  intermediate  k-feasible  solutions  (xi,yi),  I  \  b-Ax\  \  I  is  integral. 
Thus  I  lb~Axil  I  is  a  good  measure  of  xi.  because  as  long  as  our  intennediate  solutions 
remain  integral,  the  measure  is  integral  and  decreases  at  every  execution  of  Refine .  We  call 
this  measure  the  slack  measure  of  goodness.  Consequently  if  in  Step  I  of  Refine  we  use  this 
notion  of  bettemess,  then  finding  a  better  k-feasible  solution  is  equivalent  to  finding  augment¬ 
ing  paths,  flows  etc.  for  standard  LPs  (matching,  network  flows  etc.). 

2,3  Procedure  Refine  and  The  Tuning  Problem.  In  this  subsection  we  shall  present  a 
subjective  analysis  of  the  application  of  Refine  to  the  tuning  problem  arising  from  scaling. 
We  noted  in  Section  1  that  the  tuning  problem  itself  cannot  be  simplified  by  further  scaling. 
We  also  noted,  however,  that  the  optimal  value  of  the  tuning  problem  is  bounded  by  I  lx*  1 1 , 
where  x«  is  the  solution  to  the  previous  scale  (assuming  we  are  scaling  by  2).  Let  us  apply 
Refine,  with  the  slack  measure,  to  the  tuning  problem  in  order  to  find  a  k-feasible  solution 
for  it.  As  the  tuning  problem  is  the  core  of  the  scaling  algorithm,  we  need  to  estimate  how 
efficient  Refine  is  on  the  tuning  problem. 

Let  us  analyse  the  number  of  iterations  required  for  Refine  to  arrive  at  a  k-feasible 
solution.  Hrst,  in  Step  1  we  see  that  the  tentative  k-feasible  solution  improves  in  slack,  i.e., 
1 1  b-Ax  1 1 1  >  1 1  b-AxQ  1 1 ,  while  in  Step  2  we  see  that  there  is  an  improvement  in  the  dual 
function  value.  Step  2  of  Refine  is  identical  to  that  of  the  Primal-Dual  algorithm,  and  hence 
this  improvement  is  guranteed.  Further,  we  also  know  that  the  optimal  value  to  the  problem 
cannot  exceed  I  lx*  1 1 .  Then  if  the  dual  variable  increments  are  directly  related  to  the  slack 
measure  I  Ib-Axil  I  and  the  dual  variable  modification  of  Step  1  does  not  substantially 
decrement  the  dual  variables,  then  Refine  converges  quickly  to  a  k-feasible  solution.  In  other 
words,  if  Step  1  is  slow  in  finding  k-feasible  solutions,  then  as  the  slack  measures  are  large. 
Step  2  increments  dual  variables  rapidly,  and  vice-versa.  But  the  cost  of  the  dual  solution  is 
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bounded  by  I  lx*  1 1  because  the  optimum  value  of  the  tuning  problem  is  bounded  I  lx*  1 1 . 
Hence,  at  every  iteration  of  Refine ,  either  the  slack  measure  decreases  or  the  cost  of  the  dual 
solution  approaches  its  upper  bound.  This  puts  an  upper  bound  on  the  number  of  iterations  of 
Refine  required.  The  situation  of  die  dual  variable  update  being  directly  related  to  the  slack 
measure  is  common  to  most  of  the  network  flow  LPs.  In  procediue  Match ,  for  example,  at 
each  execution  of  Step  2,  the  increase  in  the  cost  of  the  dual  variables  is  -^  no,  where  no  is 

the  number  of  unmatched  vertices  at  that  stage.  But  n0=l  \  b-Ax\  \  I,  the  measure  of  xi  in  the 
bipartite  matching  LP.  Thus  the  dual  variable  update  is  indeed  directly  related  to  the  good* 
ness  measure. 


CHAPTERS 


APPLICATIONS 

In  the  previous  sections  we  have  established  some  facts  about  scaling.  This  section 
shows  how  these  facts  have  been  used  by  otter  researchers  in  earlier  papers. 

3.1  Shortest  Paths  with  Non*Negative  Edge  Lengths.  For  a  directed  graph  G(y 
and  a  non-negative  integer  length  Uj  associated  with  each  arc  (i,j)  in  £,  the  shortest  path 
problem  is  to  find  directed  paths  of  minimum  length  from  a  distinguished  source  vertex  s  to 
all  otter  vertices  of  <7 .  Let  wH  ^  1-  We  shall  presem  Gabow’s  algorithm  [3]  for  the  problem 
and  show  that  it  is  an  instance  of  the  more  general  algorithm  of  scaling  of  linear  programs. 
Gabow  first  defines  a  "near-optimuffl''  solution  as  a  vector  p ,  with  one  entry  pi  for  each  ver¬ 
tex  i,  such  that: 

(i) p,=0. 

(ii)  p  dominates  edge  lengths,  that  is.  for  any  edge  ij ,  pi+lij’^j  where  ly^. 

(iii)  The  length  of  the  shortest  path  from  s  to  i  is  between  pi  and  p,  +m . 

Thus  Pi  is  an  estimate  of  the  distance  from  s  Wi. 

Let  P  be  our  original  problem  and  let  P'  be  the  scaled  down  problem  (weights  /y 

replaced  by  [  )•  Gabow  then  proceeds  as  follows:  Recursively  obtain  the  optimal  distance 

vector  Pi'  for  P'.  Define  p=2p'.  Then  p  is  a  near-optimum  distance  vector  for  P.  Using  this 
near-optimum  pi,  compute  modified  edge  lengths,  lij=lij-(pj-pi) .  Compute  shortest  paths 
on  G  with  the  modified  edge  lengths  li/  using  Dijkstra’s  algorithm.  The  shortest  paths  on 
this  modified  graph  are  the  shortest  paths  for  the  problem  P . 
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The  efficiency  of  the  algorithm  hinges  on  the  use  of  an  array  Q  ]  as  the  priority 
queue  in  Dijkstra’s  algorithm.  Q{k)  stores  vertices  at  tentative  distance  k.  From  property 
(iii)  above  of  a  near-optimal  solution,  the  distances  in  the  modified  graph  are  at  most  m ,  and 
this  implies  that  Q  suffices  as  the  priority  queue.  Since  solving  this  modified  problem  takes 
0(jn)  time,  F  can  be  solved  in  O (w  log //)  time,  where  IV  is  the  largest  weight  in  P ,  which 
is  the  largest  edge  cost 

Now  we  frame  the  shortest  path  problem  as  an  LP  and  interpret  Gabow’s  algorithm  in 
relation  to  the  results  from  Section  1.  The  shortest  path  LP  is  stated  below: 

min  YJijfij 

Af=b  (3.1  A) 

m 

where  6=s=(-(/j-l),l,l,..,l]^,  nH  V| ,  and  A  is  the  nxm  edge  incidence  matrix  with  the  first 
row  representing  the  source  s .  Its  dual  is: 

max 

pA<l  (3.  IB) 

Note  that  the  LP  in  (3.1  A)  satisfies  assumptions  (ii)  and  (iii)  of  Section  1.  From  the  dual  LP, 

we  see  that  the  p/s  in  Gabow’s  algorithm  are  the  dual  variables  of  the  shortest  path  LP  for 

P'.  The  modified  problem  of  Gabow  is  then  precisely  the  tuning  problem  (1.6A)  of  Section  1 

because 

lii=lij-(Pj-Pi)=A  +^1/  mod  2-{pj-pi) 

which  is  precisely  dij+eij,  the  offset  defined  in  Section  1.  Furthermore  the  solution  to  the  tun¬ 
ing  problem  (1.6A)  is  a  solution  to  F(1.4A),  ensuring  that  the  shortest  paths  on  the  modified 
graph  are  the  shortest  paths  on  the  original  graph,  and  thus  proving  the  correcmess  of 
Gabow’s  algorithm. 
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In  this  case  e  is  integral,  and  the  trimming  of  d+e  suggested  in  Section  1  guarantees 


the  effective  use  of  Q  as  the  priority  queue.  The  above  example  illustrates  that  the  tuning 


problem  may  sometimes  be  efficiently  solved  using  data  stnictures  that  utilize  the  integrality 


and  boundedness  of  the  offset 


32  Weighted  Bipartite  Matchings.  We  study  the  scaling  algorithm  of  Gabow  for 


weighted  bipartite  matching  [3].  This  algorithm  shows  how  scaling  applies  to  the  Primal- 


Dual  algorithm  for  solving  linear  programs. 


Let  G  (Vi,V2JE)  be  a  bipartite  graph,  where  V  i  and  Vi  are  vertex  sets  of  the  same  size 


and  £  is  the  edge  set  Given  an  integer  cost  function  aj  an  E,  the  problem  is  to  a  find  a  com¬ 


plete  matching  in  G  of  mimimum  cost  Let 


The  LP  of  the  weighted  bipartite  matching  problem  is  given  below. 


min 


. 


(3.2A) 


Xij^  for  alii,; 


The  dual  of  this  problem  is: 


for  all  i=l,...,/j,  ;=l,...,/i. 


(3.2B) 


We  note  that  for  bipartite  matchings  : 


(a)  The  bipartite  matching  LP  satisfies  assumptions  (ii)  and  (iii)  of  Section  1. 


(b)  In  the  Primal-Dual  algorithm,  at  every  step  the  value  of  the  optimizing  function 


increses  by  at  least  y . 
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(c)  If  all  the  edge  costs  are  integers,  then  the  dual  variables  are  multiples  of  -j. 

Points  (b)  and  (c)  above  may  be  proved  as  follows.  Let  us  define  the  parity  of  a 
number  as  1  if  it  is  an  odd  multiple  of  y,  0  if  it  is  an  even  multiple,  and  undefined  otherwise. 

Claim.  At  all  times  during  the  execution  of  the  Primal-Dual  algorithm  on  (3.2A)  and 
(3.2B),  all  the  dual  variables  have  the  same  parity. 

Proof.  By  induction  on  the  number  of  steps,  or  dual  variable  modifications.  The  initial 
values  of  the  dual  variables  are: 

a,=0  fori=l../i. 

As  aU  of  [cij }  are  integral,  all  of  {a, }  and  Oy )  are  of  parity  0. 

Recall  that  to  modify  the  dual  variables,  first  9=i.min(Cfy-<x,— Py)  is  calculated  over  a 

L  ij 

subset  of  the  edges.  Thus  if  all  the  dual  variables  have  the  same  parity,  then  the  parity  of 
(X,'+Py  is  0,  and  the  parity  of  6  is  well  defined.  At  every  step  the  dual  variables  are  modified  as 
follows:  0,-4— a,- ±9,  Py4— py±9.  All  the  modified  dual  variables  will  then  have  the  same  par¬ 
ity.  Thus  at  all  times  the  dual  variables  are  half-integers  and  (c)  above  is  true.  □ 

To  establish  (b),  the  increase  in  the  dual  objective  function  is  9;c'&  [7,  Section  5.2]), 
where  jc'  is  a  solution  to  the  dual  restricted  problem  DRP.  In  our  case  jc'  is  integral  and 

6 '=[1. 1... I F.  Thus  the  increase  must  be  an  integral  multiple  of  9,  which  is  at  least  y . 

Point  (c)  above  suggests  scaling  by  2.  In  this  subsection  and  the  next,  we  shall  use  y 
(with  or  without  subscripts)  to  refer  to  all  the  dual  variables  collectively.  If  (xi.yO  is  an 
optimal  solution  to  (1.5A)  and  its  dual,  then  {x\,2y\)  i.s  a  feasible  solution  to  (1.4A),(1.4B). 
Further,  from  (c)  we  see  that  (A:i,2yi)  is  integral.  Therefore  the  veaor  e  is  integral.  Then  the 
problem  (1.6A)  is  another  bipartite  matching  problem  with  integral  edge  costs.  Furthermore 
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the  optimal  value  of  this  problem  is  at  most  dxi=^.  Now  there  are  two  options.  Either  we 

could  start  with  2y  i  as  an  initial  choice  for  (1.2B)  and  woik  toward  the  optimum,  or  we  could 
solve  (1.6A},  in  both  cases  using  the  Primal-Dual  algorithm.  Because  of  (a),  in  either  case 
there  can  be  D(n)  dual  variable  updates  (steps).  Gabow  obtained  a  fast  algorithm  that  takes 
the  first  option.  That  there  are  only  0{.n)  steps  is  crucial  in  the  analysis. 

3J  Weighted  General  Matchings.  Given  a  complete  graph  G(V^)  and  an  integer 
cost  fimction  aj  on  £,  the  problem  is  to  find  a  complete  matching  of  minimum  cost.  Let 


n=^\. 


Consider  tte  general  matching  LP  and  its  dual  below  [7,  Section  11.3].  Suppose  there 


I S  I  —1 

!  N  odd-sets  and,  for  each  k  let  ^  ~  . 


min  'Z,CijXij 


1  i =!,...,« 


(3.3A) 


Z  ^ij  for  each  odd-set  S*  /fc=l,..JV. 

ijeSt 

Xij^ 


(3.3B) 


a,-MX/+  Z  Y*^i7  for  all 

ijeSt 

Y*^  for  all 


We  note  that 


(a)  The  general  matching  LP  satisfies  assumptions  (ii)  and  (iii)  of  Section  1. 

(b)  At  every  step  in  the  Primal-Dual  algorithm  the  value  of  the  objective  function 
increases  by  at  least 


A  ’i?. 


(c)  If  all  edge  costs  are  integers,  then  the  dual  variables  are  multiples  of  -|-  (i.e., 
quaiter-integeis). 

Statements  (b)  and  (c)  above  may  be  proved  as  follows.  Let  us  define  the  parity  of  a 
number  as  0  if  it  is  an  even  multiple  of  1  if  it  is  an  odd  multiple,  and  undefined  otherwise. 

The  parity  of  a  set  is  defined  only  if  all  the  elements  of  the  set  have  the  same  parity,  and  then 
the  parity  of  the  set  is  defined  to  be  the  parity  of  any  of  its  elements. 


Let  us  partition  V  into  two  sets,  M  and  X,  depending  on  the  current  matching  on  the 
graph:  M  contains  the  matched  veitices  and  X  the  exposed  ones.  Initially,  in  the  Primal-Dual 
algorithm,  V^,  and  the  algorithm  proceeds  by  successively  reducing  X.  Thus  once  a  vertex 
has  been  matched  it  will  remain  matched  (although  its  mate  may  change). 


An  edge  ((,;)  of  G  is  called  tight  with  respect  to  the  current  dual  variables  if 

a,+a;+  T  Y*=Cy  .  The  algorithm  constructs  the  current  or  tight  gn^Jb  Ge,  which  is  the 

iJwS^ 

graph  induced  by  the  tight  edges  at  that  stage  (7,  Section  11.3]  and  updates  the  previous 
matching.  In  this  process  some  veitices  of  X  may  become  matched.  For  this  to  happen,  these 
vertices  must  be  connected  to  some  vertices  in  Af  by  edges  in  Gc  (i.e.,  tight  edges). 

Claim.  Let  A/ ,  X  be  a  partition  of  V  as  defined  above.  Then 

(i)  All  dual  variables  a,-  for  ieM  have  the  same  parity. 

(ii)  All  Of  for  i  e  X  have  parity  0. 

(iii)  All  the  odd-set  variables  y* ,  have  parity  0. 

Note  that  parity  (A/)  need  not  be  the  same  as  parity  (X). 

Proof .  By  induction  on  the  number  of  steps  or  dual  variable  modifications.  The  initial 
solution  for  the  dual  variables  is: 
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It* , <■*. W  Ita* lU* 


>1 


a,=-j  min  Cy ,  for  all 

Since  initially  X^V  and  every  Cij  is  an  integer,  our  claim  is  true  for  the  initial  solution. 
Recall  that  the  dual  variables  are  modified  according  to  the  following  rule: 


5i=-jmin  fcy-ct,— ay  :(iJ)eT^xM}  (for  the  exact  definition  of  T  see  [7,  Section 


11.3]) 

52=min  ay  :ieM,jeX  } 

53^nin(^^) 

6=min(Si,  82, 83) 
ai<-a,±9forj6A/ 
a,<-ai  otherwise  (i.e.,  i  eX) 
y*<-y*±29. 

Let  Afi,X I  be  the  partition  before  a  step  and  A/2>^2  alter  the  step.  By  construction, 
M1SM2.  Assume  that  the  dual  variables  satisfy  the  claim  before  the  step.  Then  we  show  that 
the  dual  variables  satisfy  the  claim  after  the  step. 

If  M\=M2,  then  for  i,j6Mi(=^2).  since  pariry(a,)=pari/y(ay),  parity {<Xi-Hij)=0. 
Therefore  parity  {h\y=0.  Similarly  parity {?)2)=parity{M\)  because  for  jeXxi^Xi), 
parity  {(Xj)=0.  Qearly  83  is  also  a  quarter-integer.  The  update  rule  for  dual  variables  then 
makes  the  claim  true  for  the  new  values  of  dual  variables. 

If  Af  i*W2.  then  some  vertices  of  Xj  are  now  matched.  These  newly  matched  vertices 
must  be  connected  to  vertices  in  Wi  in  the  ti^t  graph  Cg.  Therefore  there  exist  JeM^r^Xi, 
and  ieMi  such  that  the  edge  (ij)  is  tight  Now  the  parities  of  ay,  y*  and  Cy  are  0.  Hence 
parity  {aiy=0=parity  {Mi).  Thus  8  will  be  a  quarter-integer  and  the  the  claim  will  hold  for  the 


25 


new  values  of  the  dual  variables.  □ 

So  we  see  that  at  all  times  during  the  execution  of  the  algorithm  the  dual  variables 
have  a  defined  parity  and  are  therefore  quarter>integers.  Statement  (b)  can  be  proved  in  a 
fashion  similar  to  that  for  bipartite  graphs. 

Statement  (c)  above  suggests  scaling  by  4.  The  scaled  primal  (3.4A)  and  its  dual 
(3.4B)  appear  below. 

{3,4A) 

S  +^k'^k  for/fc=l,...JV. 
ijeS* 

(3.4B) 

cii'+a/+  2  forallj.;^ 

y*'S0  foraU 

Thus  if  (xi.yi)  is  a  solution  to  (3.4A,B),  then  (xi,4y  i)  is  a  feasible  solution  to  (3.3A,B) 

a- 

which  is  within  dx\^-^  of  the  optimum.  At  this  point  if  we  should  start  with  4y  j  as  an  initial 

choice  for  (3.3B)  and  anempt  to  use  Edmonds’s  algorithm,  then  we  would  run  into  a  prob¬ 
lem.  Edmonds’s  algorithm  needs  the  following  invariants  to  be  true  at  all  stages  of  the  algo¬ 
rithm. 

I  The  Xy  are  0- 1 ,  and  they  form  a  matching  of  the  graph  induced  by  the  tight  edges. 


n  If  y*  <0  then  5*  is  full,  i.e.,  5*  has  r*  matched  edges. 


26 


I 

I 

I 

I 


niIfYy<O.Yk<Oand5yrvS*5fc0then5yCS*  or  5^25*. 

We  call  a  feasible  solution  tight  if  it  obeys  tiie  above  invariants.  Whereas  (xi,y  i)  is  tight  for 
(1.5A)  and  its  dual  (j:i,4yi)  may  i»t  be  tight  for  (1.4A)  and  its  dual  (1.4B).  Gabow’s  algo¬ 
rithm  [4]  circumvents  this  problem  by  modifying  (xiAyi)  to  another  feasible  solution  that  is 
tight  and  also  within  0(n)  of  the  optimal  value  of  the  matching. 

We  approach  the  problem  in  another  way.  Now  consider  a  solution  to  (3.4A3).  If  in 
the  dual  solution  some  odd-set  variables  Yi'  are  negative  (i.e.,  Yi'=YP<0).  then  the  slack  vector 
e  has  positive  entries  corresponding  to  the  negative  valued  odd-set  variables.  Then  the  tun¬ 
ing  problem  (3.SA)  and  its  dual  (3.SB)  are: 


min  "p:i/xij+^y9yk 


i  ™1 


2  Xij+  yk=^k  fork=l,...^ 

ijsSt 


Xij^  y*2K) 


(3.5A) 


max2ja,'+^r*Y*'  (3.5B) 

a,'+ay'+  2)  fk'^i/  forall/.y^ 

ijeS, 

for  all 

Thus  the  foimat  of  the  tuning  problem  is  different  from  hom  the  standard  matching  prob¬ 
lem.  The  crucial  diSerence  here  is  that  urtlike  the  general  matching  LP,  some  odd-set  vari¬ 
ables  have  positive  weights  in  the  cost  function.  The  combinatorial  meaning  of  this  is  that  the 
weighted  blossoms  (which  were  full)  in  the  elution  of  (3.4A)  are  to  be  made  as  full  as  possi¬ 
ble  in  the  tuning  problem.  We  shall  call  the  above  format  the  tuning  format  of  the  matching 
problem. 


» 


<'• 

k- 
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We  shall  adapt  Edmcmds’s  algorithm  to  this  format  of  the  matching  problem.  First 
extend  the  invariants  to  IMIMII’  as  shown  below. 

r  The  Xjy  are  0-1 .  and  they  form  a  matching  of  the  graph  induced  by  the  tight  edges. 

n*  If  yk'<yS  then  5*  is  full. 

nr  If  Y*'<yP.  and  SjnSk*0,  then SyC5*  or  Sj^Sk. 

Consider  the  initial  solution  to  (3.5B). 

a,'=:imin(Ciy-  Y,  1fiP) 

■A  J  ijeSt 

Yi'^ 

It  is  clear  that  the  above  solution  is  indeed  feasible  in  (3.5B).  We  shall  formulate  some  upper 
and  lower  bounds  for  the  cost  function  evaluated  on  this  solution. 

Let  the  scaled  down  LP  (3.4A)  refer  to  the  graph  G',  with  scaled  down  edge  costs. 
Suppose  the  problem  (3.4A)  has  been  solved  recursively,  hence  we  have  a  minimum  cost 
matching  of  G\  These  matched  edges  must  be  tight  in  the  LP  (3.4B).  This  implies  that  the 
slack  eij=0  for  every  matched  edge  in  G'.  This  in  turn  implies  that  in  (3.5B), 
C|y  mOd  4^3. 


ia.'+ij*Y*'=  s  («.+«/+  S  YP)^i-3=^. 

1^  /Hrt  ij  matched  in  G'  *  ^  ^ 

Thus  is  an  upper  bound  on  the  cost  of  an  optimal  solution  to  (3.SB).  Also  note  that 


o,'4n.infc,- 


Therefore 


av'.v»; 


A 
>  y: 
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Define  o^^'^yS-  From  Section  1  we  know  diat  the  optimal  value  of  the  tuning  problem 

(3.5A)  is  at  most  The  value  of  our  initial  solution  to  (3.5B)  is  at  least  -a. 

Next  we  show  that  Edmonds’s  algorithm  may  be  extended  to  the  tuning  format  of  the 
matching  problem.  Fust  define  y<  and  Ji,  as  follows. 

y.=/'(i.7)!  a.'+a>'+  X  yk=Cij'} 

ijes, 

Jh={Sk \yk'=^} 

The  Primal-Dual  algorithm  solves  the  tuning  fonnat  of  the  weighted  matching  problem  as 
follows:  Start  with  the  above  initial  dual  feasible  solution  and  then  perform  several  iterations 
of  solving  a  restricted  primal  (3.6A)  stated  below,  and  update  of  the  dual  solution  accord¬ 
ingly.  The  restricted  primal  (3.6A)  depends  on  the  sets  J, ,  Ji,  above. 


1=1 . n 

Z  Xij+Wk+xS^k=5it  k=l,...J^ 

ij«S, 

Xij  ^  (i  ,j  )  not  e  Jt  -*Xij  =0 

>v*SO  S*  not  e  /&— »y*=0 
The  dual  of  (3.6A)  is  (3.6B)  below. 


(3.6A) 


max 


Oli-HXj+  Z  (iJ)eJe 

ijeSt 

Y*^ 


(3.6B) 


.s' 

•  ■  y’' 

& 


'o 


>■  • 

y: 

J 

j»r  tA 

» 


f 

i4 

y  V 
V 
> 


m 

Li 


y  ‘ 


V>l*A'JllV 


0,^1  for  alii,  yt^  for  all /k 

Note  that  the  restricted  primal  and  dual  problems  for  the  standard  format  and  the  tuning  for¬ 
mat  are  identical.  Under  the  modified  invariants  IMI’  and  m',  Edmonds’s  algorithm  may  be 
used  to  obtain  an  optimal  solution  to  (3JA)  and  (3.SB).  The  next  question  is  the  time 
analysis  of  the  above  procedure  on  (3.5A).(3.5B).  First  note  that  the  cost  of  our  initial  solu¬ 
tion  to  (3.5B)  is  at  least  -<J.  After  the  trimming  procedure  of  Section  1  each  entry  in 


the  cost  vector  of  (3.5A)  can  be  made  less  than  -^+2.  Further,  since  the  yP’s  were  obtained 

ftom  the  previous  scale,  the  odd-sets  corresponding  to  these  variables  represent  a  valid  blos¬ 
som  stmcture.  This  implies  that  the  number  of  non-zero  yf  equals  the  number  of  blossoms  in 

the  previous  scale  and  thus  is  bounded  by  -j.  As  each  yP  is  less  than  -^+2,  o  is  O  (n^).  But 

the  optimal  value  of  (3.5B)  is  at  most  Thus  our  initial  solution  is  0(n^  away  from  the 

optimum. 

Next  we  attempt  to  bound  the  number  of  dual  variable  updates,  or  steps,  required  by 
the  modified  Edmonds’s  algorithm  starting  with  the  stated  initial  solution.  First  note  that 
0=min/'5i,  52,  63^  is  at  least  -I-.  If  at  any  stage  during  the  execution  of  the  algorithm  the 
number  of  exposed  vertices  is  / ,  then  the  value  of  the  cost  function  increases  by  9/  in  that 
step,  which  is  at  least 

Initially  all  vertices  are  exposed  and  finally  none  remain  exposed.  Let  ki  be  the 
number  of  steps  between  the  i  th  and  the  (i+l)st  augmentation.  Then  for  all  i  (See  [7], 
Sertion  1 1.3 ).  Each  augmentation  reduces  the  number  of  exposed  vertices  by  2.  Furthermore, 
we  know  that  the  total  increase  in  the  cost  function  is  0(n^).  Since  the  rth  augmentation 
increases  the  value  of  the  objective  function  by  n-2i,  finding  an  upper  bound  on  the  total 


.>i 
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number  of  steps  reduces  to  the  foilwing  problem. 

max  ki+kT+...  +k^ 

'^kiin-2i)=cn?- 
ki^  and  ki^  for  all  i. 

The  optimal  value  of  the  above  problem  is  easily  seen  to  be  O  (n  which  is  better  than 
0(n^),  our  previous  estimate.  This  is  going  to  reduce  the  number  of  steps  required  for 
Edmonds’s  algorithm. 


V  JmA,  A  A  Imi. 


CONCLUSIONS 


The  scaling  approach  has  been  widely  used  to  design  efficient  algorithms  for  specific 
problems.  The  highlight  of  this  paper  was  to  examine  scaling  from  a  more  general  framework 
of  linear  programs  (LPs)  and  to  present  a  uiufied  picture  of  scaling.  Thus  scaling  may  be  seen 
as  another  approach  to  solving  LPs.  We  showed  that  the  typical  scaling  algorithm  depends, 
for  its  efficiency,  on  (what  we  call)  the  tuning  problem.  The  tuning  problem  (TP)  is  the  prob¬ 
lem  of  translating  a  solution  to  the  previous  scale  into  a  solution  to  the  current  scale,  and  it  is 
the  only  problem  specific  part  of  the  algorithm.  The  TP,  although  similar  in  format  to  the  ori¬ 
ginal  problem,  is  much  simpler. 

Scaling  fails  on  the  tuning  problem.  Most  algorithms  use  problem-specific  approaches 
that  exploit  the  structure  of  the  TP.  Gabow  and  Tatjan  proposed  an  approximation  algorithm 
for  the  TP  that  arises  from  the  general  weighted  matching  problem.  We  generalised  this 
approach  to  some  extent  and  showed  its  relation  to  the  Primal-Dual  algorithm.  The  analysis 
of  the  above  approach  is  very  tedious,  and  we  feel  that  a  more  general  framework  is  required. 

Another  approach  to  the  solution  of  LPs  was  Bertsekas’s  relaxed  complementary 
slackness.  We  showed  that  this  may  be  seen  as  the  dual  of  the  scaling  approach,  i.e.,  scaling 
of  the  primal  problem  is  equivalent  to  Bertsekas’s  approach  to  the  dual  problem. 
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